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Two methods for constructing solutions to the problem of minimizing an integral in a 
certain class of arcs joining a pair of fixed points are proposed. One of these procedures is 
a generalization of Newton's method, while the other is a '^gradient" method. Conditions 
for convergence to a strong relative minimum are given in both cases. 



1. Introduction 

In this paper we consider methods for effectively 
constructing solutions to the problem of minimizing 
an integral 



I{y)= I fix,y,y')dx 

J a 



(1) 



in a certain class of arcs yi(x), {a<x<b; i=l,2, 
, , . , n) joining the two fixed points (x=^a, y{a)= 
p) (x=bj y(b) = (T). As is well known, the minimum 
must necessarily satisfy the Euler equations 



_d_ 
dx 



fvl(^,yy)=fyi{x,y,y') (i- 1,2, . . . ,n). 



Thus in any particular case the solution of these 
equations becomes of prime importance. The two 
methods to be proposed below enable us to compute 
arcs that satisfy the Euler equations subject to 
boundary conditions at two distinct points. For 
both methods an initial estimate of the solution is 
made. From this estimate a new estimate is ob- 
tained in a definite way, and so forth. These pro- 
cedures have the advantage of giving estimates that 
satisfy the boundary conditions at each step, and 
their iterative nature makes them adaptable to auto- 
matic computation. 

The first method of obtaining new estimates that 
will be discussed is a generalization of Newton's 
method for functions of a single real variable. This 
method is described in section 2 where it is actually 
applied to a more general class of systems of differen- 
tial equations than the Euler equations, namely 
those systems of the form 



d 



j^9l^byy')=i^i^\y\y') (i=i.i2,!. 



,n). 



To construct the convergence proof, we make the 
class of functions under consideration into a Banach 
space and regard the s^^stem of differential equations 
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as an operator T on the space into itself. The neces- 
sary Banach space tools are developed in sections 3, 
4, and 5. Sections 6, 7, and 8 are devoted to an 
analysis of the properties of T and of its first varia- 
tion. The results obtained in these sections are used 
in section 9 to show that under simple conditions on 
the initial estimate the sequence of estimates derived 
from the first method will converge quadraticall}^ to 
an admissible function that satisfies the system of 
differential equations and the two-point boundary 
conditions. In fact, it is demonstrated in section 9 
that convergence will occur if the initial estimate has 
nonconjugate end points (see definition 5) and is con- 
tained in some neighborhood on which the norm of T 
is bounded by a suitable constant. These conditions 
are shown to be satisfied if a solution exists and has 
nonconjugate end points. In this case there will 
exist a neighborhood of the solution such that the 
sequence of estimates will converge for any initial 
estimate chosen from this neighborhood. In the 
special case of the Euler equations, considered in 
section 11, it is shown that the limit function not only 
satisfies the differential equations but under the 
proper conditions also affords a strong relative mini- 
mum to /(?/). Section 10 concerns itself with an 
interesting lemma about conjugate points, that 
proves useful in section 11. 

While not having as large a range of applicability 
as the first method, the second of the two methods of 
choosing new estimates that will be discussed has the 
advantage of being simpler to handle at each itera- 
tion. This method is described in section 12, where 
it is shown to be particularly useful for the solution 
of the Euler equations arising from an integral whose 
integrand is quadratic in y and y\ In this case the 
integral itself, instead of the differential equations, is 
considered; and it is shown that under suitable hy- 
pothesis the sequence of estimates converges to an 
arc that affords a strong relative minimum to I{y). 
In fact, convergence will occur if / has a lower 
bound on the class of admissible functions and the 
second variation is positive. 

Neither of the methods to be described below is 
presented as a quick computational procedure for a 
desk computer. However, they are processes that 
might well be carried out on automatic computing 
machinery. Numerical examples are to be found 
in section 13. 
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2. An Iterative Method for Solving Certain 
Systems of Second Order Differential 
Equations 

In the following pages we shall be concerned with 
constructing solutions for systems of the form 



d 



j^(9i(^, y, y'))=Hx, Vy y') (i= i, 2, . . . , n) 



yi{o)=Pu yi(b)=ai. 



(2) 



Here y = {y^^y2,. . .,?/„) denotes an arc defined on the 
interval a<x<b and p, a denote arbitrary constant 
vectors. By formal integration of (2) it is found 
that an arc y that satisfies these equations must 
also satisfy 

Ti{x; y)=j^ ^giii, y, ^0- J^ ^z(«, y^ y')ds\ dt 

-fE^j [_9i(^yyyy')-J hi{s,y,y')ds\dt=0 

(3) 
yi{o)=Pu yi(fi) = (Ti (i=l,2, . . .,n). 

Conversely, any arc without corners that satisfies 
(3) must also satisfy (2). For reasons that will be 
clarified below, we prefer to make use of the differ- 
ential eq (2) in their integral form (3). Accordingly, 
let us examine some of the properties of the function 
Ti(x',y). 
That 

Ti{a; y) = T,{b; y)=0 (i-1,2, . . .,n) (4) 

independent of the arc y is immediately apparent. 

One recalls that a function of one or more vari- 
ables is said to be of class C^, n>0, in its arguments 
on its domain of definition if it is continuous on that 
domaiu together with all of its (partial) derivatives 
up to and including those of order n. An arc 
y = (yi,y2,. . .,yn) is of class C" if all of its compo- 
nents, yij have this property. From further inspec- 
tion of the definition of Ti(x;y) it is evident that if 
g and h are of class C^~^ and y is of class (7* for some 
k on 0<k<n, then Ti(x;y), considered as a function 
of Xj is also of class (7* that is, T(x;y) belongs to the 
same class of arcs as y, provided g and h have proper 
differentiability properties. Let a primed variable 
signify differentiation of that variable with respect 
to X. Then it will be noted that the arc T'\x',y), 
which when set equal to zero merely presents the 
differential eq (2) in their usual form, does not pos- 
sess the property of belonging to the same class to 
which y belongs. In fact, even if y is of class (?, 
T^^(x;y) may only belong to class C\ 

Before describing an iterative procedure for con- 
structing arcs that satisfy (3), one must first introduce 



the functions 

^gi(^, y; v)^9iyA^, y, y')vj+9iy;.{x, y, y'Wi 



(5) 



dhi(x,y;7])^hi^nj+hiyi7i- {ij=l,2, . . . ,n) 



Ifj J "Hj 



and 



bTlx,y] r))^^'j^bgi{t,y', 77)- J^ bhi(s,y] t^^sJ dt 



■ur^-j>H'^ 



(i- 1,2,..., 71). 

(6) 



In (5), (6), and elsewhere in this paper, unless other- 
wise noted, the tensor convention of summing on 
repeated indices is utilized. The arc t] is taken to 
be such that it vanishes at the end points oi a<x<h. 
The function dTi(x,y;ri) can be obtained from 
Ti{x;y) as 

dTi{x,y;rj)=-r' Ti{x;y + av)\a=o 
da 

and is often referred to as the "variation" of Tt. 
It has properties similar to those of T^. For example, 
8Ti, considered as a function of x belongs to the 
same class as the function 77, provided that ^, h, and 
y have sufficient difterentiability. This fact follows 
from a small bit of reflection on (6) as does the fact 
that 

dTiia,y;v)=8T,{b,y;rj)=0 (i=l,2,. . ,,n), (7) 

regardless of the choice of y and rj. It is seen that 
these properties do not hold for the variation of the 
differential equations (2) taken in their usual form. ^ 
In the iteration process to be discussed here, it is 
assumed that the i*^ estimate y^^^ of an arc satisfying 
(3) has been obtained. An "improved estimate'' 
yii+i) is ii^Q^ found as follows. Determine a "varia- 
tion'' 7j by solving the system 

Tj(x;y^'^)+8Tj(x,y^'^;rj) = (i=l,2,. . ,,n) 

7ijia) = rjj{b) = 0. (8) 

The function ?/(*+i> is then chosen to be 

yii+l)=yii) + ^ii)^ (9) 

y^^^ is replaced by y^''^^\ and the steps just described 
are repeated. 

This method may be justified heuristically by 
observing that upon expanding Ti{x;y+rj) by 
Taylor's theorem one gets 

Ti(x;y+rj) = T,(x;y)+8T,(x,y;v)+^ . .. 
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Thus it is reasonable to assume that a good approxi- 
mation to a sohition of 

T,{x]y+ri) = Q (i-1,2,. . . ,n) 

for a known function y might be obtained by solving 

T,{x)y)+bT,{x,y)ri)=^ {i=\,2,. . .,n) (10) 

for 77. The analogy between this procedure and 
Newton's method for a function of a single real 
variable is clear. Consequently, it will henceforth 
be referred to as Newton's method. 

As one sees by setting (7i (a;,!/,?/')— 2/i(^= 1,2, . . .,n), 
the differential equations (2) contain the important 
class of equations 
• 
y-^hi{x,yy) (i=l,2,. . .,n) (11) 

as a special case. By setting 

hi(x,y,y')=Jy^(x,y,y') (^-=l,2,. . . ,n) 
(2) can also be made to assume the form 
d 



(12) 



yi{cO=p 



(i=l,2,. , .,n) 
yi{b)^(Ti, 



(13) 



These are the Euler equations associated with the 
integral (1), and their solution is the chief applica- 
tion of Newton's method which we wish to make in 
this paper. The functions Ti and hTi associated 
with (13) are, respectively, 

^i(^;?/)= JJ[/./- [jvi^^] ^^- 

and 



where co,^? and co^^ are derivatives of the quadratic 
form 



2aj(a:,2/ ; 7?) =/^^j,^(a;,y,2/')>?i>7^+2/j,^j,^r7;i77; +/y^y^);; 97;. 

(^J=1,2,. 



.,n). 



A deeper insight into Newton's method as applied 
to (13) may be acquired b}^ applying Taylor's 
theorem to the integrand fix;y-\-'n,y^-{-r}^). The 
expansion results in 



J{x,y+'nyy'+'n'.)=j{x,y,y')+fy^{x,y,y')rii+jy>{x,y,y')'ql 

=Ax,y,y')+Jy^{x,yy)vi+fviVi+(^{x,yyv)+ • • .. 

Let us drop the higher or«der terms and consider the 
problem of minimizing 

rb 
^(2/+^)=J^ [f{x,y,y') +fv.Vi+fvi'nl + ^(x,y;rt)]dx, 

where y is a. known function and rj is variable. The 
Euler equations with respect to rj of this integral are 



dx^'' /1// + 



A 
dx 



-^,,=0 



(i=l,2, . . .,n). 



This last system of differential equations is formally 
equivalent to (10). Thus Newton's method con- 
sists of repeatedly replacing f{x,y,y^) with an inte- 
grand that is quadratic and minimizing the new 
integral so obtained. Consequently, if the original 
integrand is quadratic, Newton's method should 
yield a solution in a single step independently of the 
choice of the initial estimate. That this actually 
occurs is readily verifiable. 

Finally, we note that although the Euler equations 
often have an equivalent formulation as a system 
of type (11), Newton's method may not necessarily 
be the same for both forms of the problem. This is 
due to the fact that in general 5T is not invariant 
under a transformation of T to an equivalent form. 
For example, the two differential equations 



d 



i(yyl = Wy^\ 



and 



i+y 



A 



are equivalent for 2/>0. In the first case 



and 






X — ( 



In the second case 



T(x;y)= y{x)-yia)-£ [ JJ (^^) ds'j dt 

-si*)-(«)-/:[/:(H^>']*!' 
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and 



.r,..„)=„x)-/; [jX^-'-Y-")"']'" 



-aja Ua V 



y 



y' 



Ads 



dt. 



It is seen that the two variations are not the same. 
Since the convergence theory to be developed below 
will be^'general enough to include either (11) or (13), 
one will be free to use whichever system of differential 
equations seems to be simpler in any given problem. 

3. Preliminaries 

There are several definitions and known results 
of general analysis to which there will be frequent 
occasion to refer. It will be convenient to have 
these collected in one section. Accordingly, the next 
few pages are devoted to this end. The reader who 
wishes more detail about what is to follow in this 
section is advised to consult a paper of Graves and 
Hildebrandt [1].^ The treatment he will find there 
is more general and couched in somewhat different 
terminology. 

Functions defined on a linear subspace C/ of a 
Banach space Y into itself are dealt with here. 
These functions may depend, in addition, upon a 
parameter chosen from F or a region of Y. When 
the arguments are indicated, the parameter will 
always appear first and be denoted by a lower case 
y perhaps bearing a superscript. It will be set off 
from the independent variables, which will be denoted 
by Greek letters -q, f, J etc., by a semicolon. 

Definition 1. F(y;7j) is said to be continuous in 
y at y^^^ uniformly relative to rj if given e>0 there 
exists 5>0 such that 



my;v)-F(y^'>';v)\\<eM 



(14) 



for all y in (y^^^)8 and every 77 in U. Here and else- 
where (y^^^)8 and similar notations denote the 5-neigh- 
borhood oi y^^\ 

The extension of definition 1 to the case of a func- 
tion F(y;7],^,. . .,?) of several independent variables 
is clear. The only essential change is made in re- 
placing eq (14) by 



my;v,^,-..,0-F{y^'>';r„^,...,^) 



<A\v\ 



\i\ 



All of the definitions that follow may be generalized 
to functions of several variables in a similar way. 
Definition 2. Let Yq be a region of Y. F(y)yi) 
is said to be uniformly continuous in y on Yq uni- 
formly with respect to i] if for every €>0 there exists 
a 5>0 such that 

\\F{y''';r,)-F{y<-''>;r,)\\<A\n\\ 



Figures in brackets indicate the literature references at the end of this paper. 



for all 2/^'^ y^^^ in Fq that satisfy ||^^'^-2/^'1l <5 and 
all 77 in U. 

Definition 3. Fiy^^^;r)) is said to be bounded at 
y^^^ uniformly with respect to rj if there exists a con- 
stant M>0 such that 

\\F(y^'^;rj)\\<MM 

for all 7]eU. If some M>0 holds for all y belonging 
to some neighborhood {y^^^)a, F is said to be uni- 
formly bounded on (y^^^)a uniformly with respect to r?. 

For the sake of brevity the words '^uniformly with 
respect to r?'^ that appear in the above definitions 
and that should appear in the concluding definition 
(4) of this section, will henceforth be omitted. The 
reader, however, should supply them for himself in 
all of the necessary places. 

The following lemma is an easv consequence of 
the definitions. 

Lemma 1. If F {y;rt) to Uis continuous in y at y^^^ 
and bounded at y^^^, there exists a neighborhood of y^^^ 
such that F is uniformly bounded on this neighborhood. 

The conclusion follows from the inequality 

my;v)\\<\\F(y^'';rj)\\ + \\F{y;v)-F(y^'^;v)\l 

Definition 4. F{y;r}) to U is said to be of class 
C in y uniformly on Yq in case there exists a function 
dF(y;rj,^) on YqUU to C7 having the following proper- 
ties: 

(1) dF is uniformly continuous in y on Yq. 

(2) dF is linear in f and uniformly bounded on Yq. 

(3) the function Riy^'^^y^'^irj) defined by 

F(y''' ;rj)-F(y''' ;r?) -dF(y^'^ ;rj,y^'^ -y^'^) 

=R(y'''',y'''',v)\\y'''-y'''\l y'''^y''\ 

Riy''';y''';v)=e (15) 

is such that given e>0 there exists 5>0 such that 

for all y^^\y^^^ in Yq satisfving \\y^^^—y^^^\\<d and all ri 
in U, 

Here and elsewhere 6 refers to the zero of the 
space Y. 

Lemma 2. If F is of class C in y uniformly on Yq, 
there exist nonnegative constants a and M such that for 
every y^^^ and y^^^ of Yq for which \\y^^^—y^^^\\<a it is 
true that 



(2)1 



\\F{y^»;n)-F(y^'^;r,)\\<M\\y^»-y 



for all 77 in U. 

The conclusion, of this lemma follows from an 
application of (2) and (3) of definition 4 to eq (15). 

Finally, we state a lemma that corresponds to 
lemma 16.3 of Graves [1]. 

Lemma 3. Let F (y;^) to U be bounded at y and 
linear in r/ for each y of the region Yq. In addition let 
y^^^ be a point of Yq and let F have a bounded inverse 
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F~^(y^^\-rj) with respect to rj, and be of class C" in y 
uniformly on Fq. Then there exists a constant a such 
that for each y in (y^^^)a, F~^{y;ri) exists and is of class 
C" in y uniformly on (^/^^Oa- 

4. A Convergence Theorem 

In section 2 an iterative method for constracting 
solutions of the Euler equations was described. 
However, no mention was made of conditions of 
convergence. In this section we will describe and 
give conditions for the convergence of a similar 
procedure that can be applied to a large class of 
functions. Later the process of section 2 will be 
considered as a special case of that to be given here, 
and it will be shown that under simple assumptions 
the special process satisfies the hypothesis of the 
convergence theorem of this section. 

Let S{y) be a continuous function not identically 
zero defined on a region Yq of a Banach space F to a 
subspace U. The problem of interest is that of 
obtaining solutions of the equation 



S(y)-- 



(16) 



Toward this end one introduces a linear operator 
K(y]rj) on U to C/, which depends in addition on 
some parameter taken from Y. It is assumed that 
K{y;7]) is nonsingular in ry. Let the ith. estimate 
2/^*^ of the solution to (16) be known. The -i^H-lst 
estimate is constructed as follows. A function rj^^'^ 
is determined by solving 



and the function ?/(^+i> is chosen to be 



yii+^)=y(i)-\-yj(i)^ 



(17) 



(18) 



The process is then repeated. 

In order to simplify the statement of the con- 
vergence theorem, several conditions will now be 
introduced on the function S(y) and on K~^{y;7]), the 
inverse with respect to tj of K{y',7]). 

(I). The arc y^^^ of Yq, the constant 7>0, and the 
operator K~^(y;r]) are such that there exists a positive 
constant Mjor which 

||K-H?/^^^;.7)-if-^(y^^^;r7)||<M||2/^^^-^^^^IM|r;|| (19) 

jor all 7} in U and all y^^^, y^^^ in (y^^^)y. Furthermore, 
there exists B>0 such that 

\\K-^{y;v)\\<B\\v\\ (20) 

for all y in (y^^^)y and all r] in U. 

From (19) it is clear that K~^ is continuous in y 
at y^^K Therefore by lemma 1 one sees that in order 
for (20) to hold on some neighborhood of y^^\ which 
is also a subneighborhood of (y^^^)y, it is sufficient to 
assume that K~^(y;r}) is bounded at y^^\ The 
remaining part of (I) is, of course, undisturbed on a 
subneighborhood of (y^^^)y. 

In the next two conditions certain non-negative 



constants H, L, and A^ appear. These are taken to 
be such that they satisfy 



(II), For all y^^\ y^^^ in (y^^^)y the inequality 



(21) 



I |ii:(2/"';y<» -2/<^') -S(2/<'>) +S'(2/<^0 



|<£||y(l)_y(2)| 



(22) 
is true. 

This condition asserts that K is an approximate 
differential of S on the neighborhood (y^^Or- 

(III). There exists a constant C such that 0<^C<Cy 
and 

||S(2/">)||<(l-H)(7/i?, (23) 

while 

\\sm<N (24) 

for all other y in (y^^^)y. 

Theorem 1. Let the initial estimate y^^^ and the 
positive constant y be such that (J), (//), (///) are 
valid. Then the sequence of estimates determined by 
(18) converges to a unique element y of {y^^^)y. The 
arc y is the unique solution of eq(16) in iy^^\. 

First we prove that 

||2/(i)_2/(<-i)||<iJ^-i(l-iy)(7,2/^^>m(7/<«Or; ^>1. 

(25) 

In order to construct the proof by induction con- 
sider the inequality 



<B\\S{y^'^)\\<(l- 



.^(^(0)))_2,(0)| 

H)C<y. 



This last follows from (23) and establishes (25) for 
^=l. Assume that (25) is true for all i<m—l. 
By this assumption in conjunction with (19), (22), 
and (24) one may write 

\\y 



("»). 



-2/(™-l)|| = ||y(«-l)+i^-l(2,C«-l)._S(y(m-I))) 

_y(-»-2)_2ir-l(y(»-a;-S'(y'«-->))|| 

< I ly"""" -i/C"-^) -K-'iy^'"-'^ ; 5(2/""-")) 
+K-'{y<'--^^ ■ S(y<"-^')) 1 1 + 1 |K-'(y'"'-« ; 
5'(y(«-2))) _K-i (y(«-:) . SCt/'™-^))) II 

<\\K-\y^'^-'^;K{y^'^-^^; 



y 



(m-l). 



.y(«-2))_5'(yc«-i))4.5f(y(m-2)))|| 



+M||y<»-»-2/<"'-^>|l.||5(i/<»-^)| 



<5||E:(2/"»->';t/<'»-'>- 



_2/(»-2))_5'(y(«-l))_^5(2/(»»-2))|| 



4-^||j/(m-l)_y(«-2)||H5r(yC«-2))|| 

<{B-L+M-N)\\y^'^-'>- 
<H-H"'-'{1-H)C. 



;(m-2)| 
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This establishes the first assertion of (25) for i=m. 
The rest of the proof of (25) follows from the in- 
equality 

m m 

||y(".)_y(0)|| = ||;2(y(i)_y(^-i))||<2||2/<«-2/"-»|| 



i=l 



(26) 



If a sequence {y^^^ } in a Banach space is such that 
the series S| |t/^^^ — ?/^*~^^ 1 1 is convergent, it has a unique 
limit. Therefore, by (25) there exists a unique ele- 
ment ^ of F that satisfies 



lim\\y''^-y\\ = 0. 



(27) 



It follows from (26) and (27), that y must belong to 
Upon passing to the limit in the expression 

and recalling the fact that K~^ and S are continuous, 
one obtains 

K-'(y;S(y))=e. 

This implies S(^)=^. 

To see that y is the unique solution of (16) in 
(y^^^)y let t/* be another element of this neighbor- 
hood satisfjang (16). Then by (20) and (22) 

\\y-y'^\\ = \\y-y''-K-\y;S(y))+K-Hy;S(y*m 
= \\K-'iy;K(y;y-y^)-S(y)+S{y*))\\ 
<B\\K{y;y-y^)-S{y) + S(y*)\\ 

<BL\\y-y*\\. 

Since (21) implies that 5i<l,^=?/*. 

It is possible to modify the method of obtaining 
iterated estimates by always holding y in K{y;7j) 
fixed at the initial estimate y^^\ This is equivalent 
to originally introducing an operator Kip), which 
depends only on functions from U. In this case, of 
course, the convergence theorem may be simplified. 
Condition (I) clearly no longer applies, and condition 
(III) may be modified so that (24) is eliminated. 
The resulting theorem may be stated in the following 
way. 

Theorem 2. Let K(r}) be a nonsingular linear 
operator. Let K~^{r]) be bounded and let the positive 
constant L be such that \\K~^\\L<C.l - Then if the 
initial estimate y^^^ and the positive constant a are such 
that 

(l)forally^'\y^'Hn (y^'^)a 

||ii:(y(l)^2/(2))_5(^(l))+S(^(2))||<i||^a)_y(2)|| 



(2) for some C, 0<6'<a: 



ll>S(i/^^OII< 



{l-\\K-^\-L)C 
\\K-^\ 



the sequence of estimates {y^*^} will converge to a unique 
element y of (y^^Oa- The arc y will satisfy eq (16) 
uniquely in (y^^Oa- 

Thus when K is independent of y^ only an initial 
approximation to the solution at one point and an 
approximate differential of S are required to insure 
convergence. Consequently, this last theorem is 
seen to be just the classical implicit function theorem 
in general analysis. One notes that in this case the 
continuity of S is not required. 

This section will be completed by establishing 
conditions under which the sequence of estimates 
will converge quadratically. 

(IV). Let y^^^ be an element of Y and let 7 and U 
be constants such that 

||2^(^(i).y(i)_y(2))_s(2/a))+S(2/^^>)|| 

<L'\\y^'^-y^'^\Y (28) 

for all y^'\ y^'^ in (y^^^)y. 

We may now state the following theorem. 

Theorem 3. Let the initial estimate y^^^ and the 
positive constant y be such that (/), {III), and {IV) 
are valid. Then the sequence of estimates determined 
by {18) converges quadratically to a unique element y 
of some neighborhood {y^^^)a, ci<y. The arc y is the 
unique solution of eq {16) in {y^^^)a- 

To see that (IV) implies (II) on {y^^^)a, a<y, 
merely take a small enough so that L^\\y^^^—y^^m<L 
for all y^^\ y^^^ in (y^^Oa- As (I) and (III) are undis- 
turbed on the smaller neighborhood, all conclusions 
but that of quadratic convergence follow at once 
from theorem 1. The quadratic nature of the con- 
vergence can be established by the following calcula- 
tions. From (17) and (18) it follows that 

^(i)_^(i-i)3=if-i(y«-i)._S'(y(i-i))). (29) 

Therefore, 

Thus by (28) 

\\S{y^m<L'\¥''-y''-''\\'- 
Making use of this last result in (29) one obtains 

\\yii)-yii-i)\\<Br\\y^'-'^-y^'-'^\\'. 

This establishes theorem 3. 

5. Some Properties of 8T{x,y]7j) 

In this section some properties of the function 
8T, which was defined in section 2, are developed 
with an eye toward applying the considerations of 
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section 4 to Newton's metliod. First, however, the 
space Y to which the arcs y belong must be more 
precisely defined. The space 1^ is chosen to be the 
collection 0/i(;r)) of all 7^-tuples of real valued func- 
tions that are of class (7' on the interval a<x<b. 
Addition of elements in Y and multiplication of 
elements of Y by real scalars are defined in the usual 
manner. Under these definitions of addition and 
multiplication Y forms a linear vector space over the 
real numbers. 

For any y in Y let 

Then a topology may be introduced into Y by 
defining a norm 



\\y\\= sup |2/(x)|Vsup \y\x)\. 



a<x<b 



(30) 



The fact that (30) satisfies the properties of a norm 
is easily verified as is the fact that ||2/^^^— 2/^^11 has all 
of the properties of a metric. It is also not difficult 
to establish that any Cauchy sequence of elements 
of Y must converge in the sense of the norm to an 
element of Y. In fact, convergence in Y implies 
uniform convergence in the ordinary sense of the 
sequence of first derivatives. Thus in the topology 
specified by (30) F is a Banach space. 

The subclass of arcs in Y that have the property 



y{a)=y{b) = 



(31) 



w ill be denoted by the symbol U. It is clear that U 
is a linear subspace of F. 

In order to eliminate repetition when discussing 
certain properties of the norms of elements of F, it is 
convenient to have the following simple lemma 
available for reference. 

Lemma 4. Lety^rj, and Q(x,y;7]) belong to Y, Let 
there correspond to each component Qt of Q a non-nega- 
tive number C^^^ such that 

sup mx,y;v)\<0^*nM\. 

X 

Also let there correspond to each component Q[ of Q\ the 
first derivative with respect to x of Q, a non-negative 
number C^^^^ such that 

sup \Q:{x,y;v)\<C^'''M\. 
Then there exists C>0 such that 

\mx,y;v)\\<C\\v\\. 
By (30) either 



\Q(^yy'yrj)\\=sup 



< 



Ssupi«,|<se"||,i|, 



or 



l\Q(x,y;v)\\=snp(±m'^ 



< 



z;supi(2;i<z;^""ii';ii. 

i=l X i = l 



Obviously C can be chosen to be the greater of the 
numbers S C^'^ and S C^^\ 

In addition to defining the space of functions under 
consideration, it is desirable at this point to state 
some assumptions that will hold throughout the 
remainder of the paper. Unless otherwise noted, it 
is assumed that the functions g{x,y,y^) and h(x,y,y^) 
are of class O^ in a region R of Euclidean 2n+ 1 —space. 
Any arc of F, all of whose elements {x,y{x),y' {x)) are 
in R ,will be termed admissible. At least one admis- 
sible arc is required to exist. This implies the exist- 
ence of a subregion Fq, all of whose elements are 
admissible. It is also assumed that the determinant 
\giv'.{x,y,y^)\ never vanishes in R. In the case of the 
Euler equations, that is, when g and h are given b}^ 
(12), the above assumptions imply that the deriva- 
tives fy. and /j/.' of the integrand f(x,y,y^) must be of 
class C2 in R. 

In eq (6), which defines the function dTi(x,y;'n), 
let the parameter y range over Fq and the variable 
7j range over U. The following result is then valid. 

Lemma 5. For each y belonging to Yq the transfor- 
mation 8T(x,y;7])^(dTi, . . ., bin) is on YqU to U 
and is linear in rj and bounded at y. 

The fact that 5 T is an element of U follows from 
the considerations of section 2. The linearity in r] 
is a consequence of the corresponding property of eq 
(5) and of the operation of integration. 

To simplify the proof of the boundedness we 
introduce a lemma that will be found useful in wdiat 
follows. 

Lemma 6 . Let g(x;y) be given ly 

q(x;y) =aj{x)yj{x) + bj(x)y-{x), 

where aj(x)j bj{x) {j=l, 2^ . . .,n) are continuous on 
a<x<b and y is in F. Then there exists a constant 
C>0 such that for each y in Y 



and 



sup 



sui^\q(x;y)\<C\\y\\ (32) 

X 

(%{t;y)dt<{b-a)C\\y\\. (33) 

J a 



On recalling that sup | ^i | < 1 1 1/ 1 1 and sup |?/- 1 < 1 1 ^ 1 1 for 
all (i= 1, 2, . . .,7i) one quickly obtains the inequality 

suplg(x;y)|<(Ssup|a,|)||2/||+(2:sup|6,|)||2/||-C||2/||. 

X j X J X 
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The second statement follows from the first by 
means of the inequality 

sup \\{t;y)dt< \ \q(t;y)\dt< sui^\q(t;y)\dL 

X \Ja Ja Ja t 

Upon noting that dgi and 8hi are of the same type 
as qix;y) and applying lemma (6) one obtains the 
inequality 

\dTi\<suv\dTi\ 

X 

=sup I Sgi— I 8hids \dt— 

X \ja L Ja J 

rb\ nt I 

<2 \8gi— dhidsldt 

Ja\ J a I 

<2 sup|5^j|+sup I 8hids\ \dt 

Ja L X t \ja J 

A similar result follows for \bTi\ 
sion follows from lemma 4. 



Thus the conclu- 



6. An Existence Lemma 

In section 7 the existence of an arc t] m U that 
satisfies 



x--i 

=li{x), (^=1,2,.. .,71) 



dhfds dt 
(34) 



for an arbitrary function l(x) in U will be discussed. 
To facilitate this discussion it is convenient to have 
available the following lemma. 

Lemma 7. For each admissible arc y and every 
n=tuple of functions mi(x) continuous on a< x<b 
the system 

8gi{^,y',v)- \jhi{t,y;r})dt=mi{x) ii=ly2, . . .,n) 

(35) 

has a unique solution rj(x) which is of class C and 
satisfies 7j(a) = d. 

An alternate form for (35) to which known 
methods of proof used in constructing standard 
existence theorems can be directly applied will be 
introduced. Let 

^i=^gi=giy^vj+gi/^Vi (i=i,2, . . .,n), (36) 



Because by hypothesis the determinant \gij/.\ is 
never zero, eq (36) can be solved for rj-. The solu- 
tion is 



'rj'.=Wi(x,rj,^) (i=l,2, 



., n) 



(37) 



where Wf is a linear and homogeneous function of rji 
and ti' By use of (37) 8hi may be expressed as 

dhi=hiy^rjj+hi,/.Tj{x,r),^), 

a linear and homogeneous function in ry^ and ff. 
Therefore, an}^ pair r?, ry' satisfying (35) are trans- 
formed b\^ (36) into a pair r}, f satisfying 

Ti— I [hiyjVj+hiy'.'jri{t,r},^)]dt=mi(x) (i=l,2,. . .,n} 
rii- riTi(t,'n,^)dt=0 (i=l,2,. . .,n). (38) 

J a 

The last n equations of (38) are obtained by inte- 
grating (37) and recalling that 7]i{a) = 0. 

Conversely, assume r?,^ satisfy (38) the second 
condition of that set implies (37). Inverting this last 
system of equations, one obtains (36), and by substi- 
tution in the first condition of (38) eq (35) result. 
Thus instead of seeking a solution of (35) which is of 
class C^ one ma}^ equivalently seek a continuous 
solution of (38). 

The system (38) has the form 

yi{x)= I Aij(f)yj(t)dt+mi{x) {i=l,2, . . .,27i) 

(39) 

where Atj, nli are all continuous on a <x <b. From 
well known methods of proof which have been used 
by Bliss [2, p. 274-278] and others for systems of 
differential equations which can be obtained by 
formally differentiating (39) it can be shown that 
(39) has a unique continuous solution satisfying 
yi(a) = nii{a). Since 

yn+i(a)=vi(ct)=0 {i=l,2, . . .,n) 

the lemma is proved. 

7. Existence of an Inverse of 8T{x,y,ri) 

In this section the existence of a linear bounded 
inverse in t; for the function 8T(x,y;7}) is discussed. 
An idea that plays a leading role in this discussion 
is that of conjugate points. 

Definition 5. A point P^ is said to be conjugate 
to the point Pi on an arc joining Pi and P2 if there 
exists a solution of 



j^8gii^,y;v)—^f^i{^yy;v)=o 



(i=l,2, ...,7l) 

(40) 
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along this arc, which vanishes at Pi and P3 but is not 
identically zero between Pi and F3. 

A consequence of definition 5 that will be apropos 
to our present purpose is stated in the following 
lemma. 

.,?^) he solutions oj 

(41) 



Lemma 8. Let rj^^'^ {i,j=l,2, . . 

LiM^hi— I ^hidt=8ij, 
{i=l,2,.,.,n). 



Then x=b is conjugate to x=a if, and only ij, the de- 
terminant \rjt'\b)\ is zero. 

By lemma 7 the r?/'^ (x) described in the hypothesis 
exist and are of class C\ In addition to satisfy- 
ing (41), the 77/'^ are n linearly independent solutions 
of (40). Thus the ?i-parameter family of solutions 
of (40) whicl^ vanish at x=a is given by 



Vi{x,CuC2,'..,C,)^v^^''{x)C, 



(^,i=l,2, . . .,r^). 
(42) 



Setting 7}i{b)=^0 in (42), one obtains the system of 
linear algebraic equations 



vPib)C,=0 (i=l,2,...,n). 



(43) 



In order for x— 6 to be conjugate to x=ay the system 
(43) must have a nontrivial solution. This is pos- 
sible if, and only if, \rj^^' (6)1 = 0. 

Lemma 9. For each admissible y in Y whose end 
points are not conjugate dT{x,y;rj) is a 1 — 1 mapping 
U onto itself. 

By lemma 5, 8T(x,y;ri) is an element of ?7for each rj 
in U and any admissible function y. It remains to 
be demonstrated that the system (34) has a unique 
solution 7j in U for each l(x) in U provided that the 
end points of y are not conjugate. By lemma 7, there 
exists a unique "rj (x) of class C^ which satisfies 

8gi— I bhidt=Vi{x), ^i(a)==0 (2=1,2, ... ,7i). 

Let 77/'^ be the functions mentioned in lemma 8. 
Then by lemma 8 there always exists a unique set 

[Ci] {i=l,2, . . ,,n) such that 



Consequently, by (41), 
is a solution of 



(i=l,2, ...,7^). 



(2=1,2, ...,n) (44) 



dgi- I bh4t=l'i-\-Ci (i=l,2, . ..,7i) (45) 

which vanishes at a;=a and x=b. That 77(2;) is of 
class C is apparent. That 7]{x) is unique follows 



from the condition of nonconjugate end points. For 
if 7]^^^ and 77^^^ both satisfy (45) and the boundary 
conditions, their difference must satisfy (40) and 
vanish at both end points of a<x<b, that is, their 
difference must be identically zero. 

By integration of (45) it follows that 77 (x) must 
satisfy 

I ^Qi— I ^hids \dt— Ci{x—a)=li(x) 

(2=1,2,... ,71). (46) 
Letting x=b in (46) one sees that 

r^dgi- r dhids']dt-Ci{b-a)==li(b) = 

(2=1,2, ..., 71). 

Therefore, 



^^^6:^JaT'^^"-J/H^^ 



(2=1,2, ...,7l). 



Consequently, r]{x) satisfies (34). 

We conclude with a lemma that states the result 
we have been pointing at in this and the preceding 
section. 

Lemma 10. For each admissible y in Y whose end 
points are not conjugate the inverse of 8T(x,y;r}) exists 
and is linear and bounded at y. 

By lemma 5, and lemma 9, bT{x,y;r{) is a bounded 
linear transformation mapping a Banacli space in a 
1 — 1 manner onto itself. As is shown in Hille [3, 
p. 28-29], a transformation having the above prop- 
erties must possess a bounded inverse. 

The inverse of a linear transformation is linear. 

8, Differentiability Properties of dT(Xjy;7j) 

In this section two lemmas that will be useful in 
establishing the fact that 8T is one of the class of 
operators that was denoted by K{y;'n) in section 4 
are proved. The first lemma will help to show that 
8T satisfies condition (I) of section 4, whereas the 
second relates to conditions (II) and (IV) of ttie 
same section. First, however, we present a lemma 
that will considerably simplify the statement of the 
proofs of our two principal ones. 

Lemma 11. Let p{x,y,y\rj,^) be of the form 



p{x,y,y\vjO=(^ij(xyyjy')vi^j+bij(x,yy)vi^] + 



(47) 



where atj, bij, Cfj, are all continuous functions on the 
same region R on which g and h are defined, y is an 
admissible element of Y, and rj, ^ are elements of U. 
Let y^^^ be any admissible element of Y and 6>0 be 
such that the closure of (y^^^)8 contains only admissible 
arcs. Then given €>0 it follows that there exists d>0 
such that 

\p(x,y''\y''",v,0-pix,y''\y''^',v,0\<eM'm\ (48) 
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for all y^^\ y^^^ in (y^°^)a, which satisfy \\y^^^—y^^m<Cd 
and all tj,^ in U. Furthermore^ there exists a constant 
C>9 such that 



snv\p{x,y,y',r,,^)\<GM\-m\, 



(49) 



and 



sup I \y{x,y,y',v,mx <{h-a)C\U\\-m\ (50) 

■for y in the closure of (y^^^s, 17, k i'^ U. 

Statements (49) and (50) follow from a simple 
extension of lemma 6. 

In order to establish the remaining part of the 
lemma, let 

<p(x,y^'\y^'\7,,^)=p{x,y^'\y^'^\v,^)- 

p(x,y^^\y^^^\7],^). 

Then ^ is a continuous function on the closed and 
bounded set S:a<x<b, y^^\ y^^^ in the closure of 
(2/(o))5, ||,;[|=.z 11^11 = 1. That it is sufficient to merely 
consider 77, | on the unit sphere follows from the 
bilinearity of p. It is clear that <^=0 on the subset 
of S for which y^^^=y^^\ Thus |^|<^€ on a neighbor- 
hood of this set, that is, on a<x<b, y^^\ y^^^ in 
(y'^'h, \\y'''-y'n<d, ||r;||=||?|| = l for (^ sufficiently 
small. This proves lemma 11. 

Lemma 12. For each admissible y^^^ in Y there 
exists a constant 5>0 such that 8T(x,y;rj) is of class C^ 
in y uniformly on (y^^^)^- 

Let d^gi{x,y;rj,^) and 8^hi(x,y;Tj,^) be defined by 

^^9i^9iyiV,(^,y,y')vj^k+giv^y;(.x,y,y')vj^[+ 

9iv;vk'nUk+9iy;-v;^Vi^ky (5 1) 
and 

d^hi^hiy.yJiX, y, y')Vi^k+hiy.yl^r]j^f,+ 

where y is some admissible element of Y and 77, ^ are 
arcs in U. Then the function d{bT) mentioned in 
definition 4 can be defined as 

(Z(5r,)^52T,(a;,^;77,9=JJ[5V~Jj5%cZs]^^ 

-fE| fT^'^^- r^'Msl^^ (^=1,2,. . .,n). 

By reasoning similar to that used in considering bT 
it follows that b'^T is on Y^UU to U, That b^T is 
bilinear in 77 and ^ is obvious. To complete the 
proof of the lemma conditions (1), (2), and (3) of 
definition 4 must be established. 

(1) For every admissible y^^^ in Y and every finite 
5>0 such that the closure of (y^^^)s contains only 



admissible arcs b^T{x,y;7]y^) is continuous in y uni- 
formly on (y^^^)s' 

Let F{Xjy,y^;rij^) be any function defined on Y and 
[7. Then we define 

AF(l,2)^Fix,y^'\y<'^';y„i)-F(x,y<'\y^'>';r,,^). 
Keeping this convention in mind, one may \vrite 

sup\8'T,ix,y^»;v,i)-5'T,{x,y^'^;vM 

X 

= sup TTas^^, (1,2)- {\b^hi(\,1)ds\dt 

-|^ f [A5Vi(l,2)- rA527^,(l,2)(i5l J (53) 

<2 rlA52^,(l,2)— [\b''hi{\,2)ds\dt 

Ja I Ja I 

<2 f \^b'gi(il,2)\dt+2{b-a) f \Ab'hi{l,2)\dt 

Ja Ja 

Examination of (51) and (52) reveals that both b^Qt 
and b%i are of the type pix^y^y^rj,^) defined in 
lemma 11. Thus application of lemma 11 to (53) 
yields the conclusion 

sup\S'T,(x,y^^^;v,^)-&'T,{x,y<'^;v,m<^\\v\m\\ 



for all y^^\ y^^^ in (y^^^)s which satisfy a condition of 
the form \\y^^^—y^^m<d. The same result can be 
established when b^Tt is replaced by b^T^. There- 
fore, (1) follows at once from lemma 4. 

(2) For every admissible y^^^ in Y there exists a 
5>0 such that b^T(x,y;rj,^) is uniformly bounded on 

(y'^'h. 

From an application of (49) and (50) of lemma 11 
the inequality 

sup|62Ti(x,7/(«^;r7,?)|<2r IbQi- Cbhidsldt 

X J a \ J a I 



<2 sup|5^i|+sup 

Ja L i t \Ja 



bhfdsl dt 



<G^»\\v\\-m\, <^"'>0 (54) 

is obtained. From analogous considerations the 
existence of C^*^'>0 such that 



sup|5^TKcr,2/<^,,^)|<(7<"'||,||.||||| 



(55) 



can be demonstrated. Because of lemma 4, in- 
equalities (54) and (55) imply that b^T is bounded 
in y at y^^\ By (1) 5^T is continuous in y at y^^\ 
Thus by lemma 1 there exists 6>0 such that b^T is 
uniformly bounded in y on {y^^^)s. 

(3) For each admissible y^^^ in Y and each finite 
5>0 which is such that the closure of (t/^^Os contains 
only admissible arcs, the function R {y^^\y^^\'rj) de- 
fined by replacing F by bT and dF by b^T in (4) is 
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such that given e^O there exists dy>0 for which 

m{y''\y''';ri)\\<4ri\\ 

for all y^^\ y^^^ in (y^^^)^ satisfying \\y^^^—y^^^\\<d 
and all ry in U. 
Consider 

dT,{xy'^;rj)-8T,{x,y^'^;rj)= r [Ag,,^{l,2hj 

Ja 

+Ag,,.{l,2)v:-£{Ahiy.{l,2hj+Ah,,:{l,2W,)ds]dt 
- r(Ahiyn,2)vj+AhiM,2h-)ds]dt. (56) 

J a 

Applying the law of the moan to a typical A-symbol 
in (56), one obtains 

where 

Thus by applying the law of the mean to the remain- 
ing A-symbols and taking note of (51) and (52), we 
may write 

=£D(x,y;v,Odl-^^£Ddt, 
where 

+Ag,,:y^i3,i)r,'M'^-yn+^giy:yiiSAh:iyr-yr) 

(yr-yn 

+Ahtyj,^{S,i)n;{yt''-yn+^hty:,.{3,i)v;x 

{yr-yn]ds. (57) 

Of course, the 3 appearing in the arguments of the 
various A-symbols does not always refer to the same 
value of fx. 

The {unction giy^y^7jj(yl,^^ —y^^^), that is typical of 
the functions to which the A operation is applied in 
(57), is of the type considered in lemma 11. Thus 
it readily follows from that lemma that, 

sup |5T,(x,y<»;„)-5T,(x,y®;77)-5^r,(x,2/(»; „,2/">- 



y^'^)\<2J'\D-j^Dds\dt<.\\v\\-\\y^»- 



,(2)\ 



for all y^'\ y^'^ in {y^^^)s such that \\y^'^-y^'^\\<d. A 
similar result holds if 5 7"^ and d'^Tt are replaced by 
their derivatives. Therefore, by lemma 4 it is true 
that 

= I !m^,^^^\- ^) -5r(;r,7y ^2) ; r;) -5^T(:c,2/^^> ; r;,7/^^) - 7/<2)) 1 1 

<e\\v\\'\\y'''-y''n\ (58) 

for all y^^\ y^^^ in (y^^^)^ that are close enough together. 
Statement (3) follows at once from (58). 

By choosing the 5's of (1) and (3) to be equal to 
each other and less than the 8 mentioned in (2), 
one obtains a 8 that is suitable to be used in lemma 
12. 

Lemma 13. For any admissable y^^^ in Y there 
exist constants aj>0, i'>0 such that 

||5r(x,7/^i);7/(i)-7/<2>)-r(x;7/^^>) + T(x,y-^))|| 

<U\\y^'^~y^''\\' 
for all y^^\ y^^Hn ?/^^0«- 

Consider 

=J TAj7i(2,l)- rAh,{2,l)ds']dt- 

''^JJ_A.M2,l)-£Ah,i2,l)ds']dt. 
By the law of the mean 



Ag,(2,V) =giy, (a-,2/<^> ,?/«'') {y?'-yn + 

9iv/(x,y'-^\y'-''''){yr'~yl''') 

where 

y(3)=y(.)+^(y(2,_y«,)^ 0</X<l. 

As a similar result holds for Ahi, we may write 

57',(x,2/<»;i/<»-2/<2>)-r,(x;2/<") + ?'i(a;;2/'^')- 

£z(x, 2/;,, i)dx=='^^£zdx, 
where 

Z^Ag,y.{l,Z){yl''-yr')+Ag,y.{\,^){yr-yr)- 
£ {Ah^y^{\,Z){yf -yn+Ah,,; (1 ,3) (yW - yf > ')]ds. 

Of course, the 3 appearing in the arguments of Agj 
and Ahi does not always refer to the same value of n. 
Application of the law of the mean to a typical term 
of Z yields 

A^.>;(1,3) =9iy',y,(x,y^'\y''' ') (j/f' '-j/f ' ') (yi" -T/f )m 
+9iv'A(x,y''\y'*n iyl" '-yf ') (yi" '-yf ')m, 
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251762—53- 



where 



V 



(4), 



.^(1)+^*(,^(1)_^(2))^ 0<M*<1. 



It will be noted that A^fy:(l,3) is of the same form 
as the functions considered in lemma 11. Let a be 
chosen small enough so that for any y'^^^ y^^^ in (y^^^)ay 
y^"^^ lies in some fixed neighborhood (y^^Oa whose 
closure contains only admissible functions. Then 
because of our previous hypothesis on the class of 
hi and Qi, we may apply lemma 11 from which the 
existence of C^^^>0 such that 

sup |5T,fei/^^:y^^^-2/^'^)-r.fey^^O + 7^.(.^;i/^^OI 

<(7(^)||^(l)_^(2)||2 

follows. As similar considerations are valid when 
dTi,Ti are replaced by bT'^Xiy respectively, the con- 
clusion of lemma 13 follows from lemma 4. 

9. Convergence of the Sequence of 
Estimates Derived by Newton's Method 

The main theorem on the convergence of the se- 
quence of estimates derived from Newton's method 
is the following. 

Theorem 4. Let y^^^ he an admissible element of Y 
having nonconjugate end points. Then if there exists a 
sufficiently small constant /^>0 such that\\T(x,y^^^)\\<^D, 
the sequence of estimates derived from Newton'' s method 
will converge Quadratically to a unique arc y of class C^ 
which lies within a predetermined neighborhood of y^^^ 
and satisfies T{x;y) = e,y{a)=y^^^{a),y(b)=y^''^(b). 

By lemm.a 12, there exists a constant ai>0 such 
that 8T(x,y;r]) is of class C in y uniformly on {y^^^)a^. 
By lemma 10, bT~^{x,y^^^]r\) exists, and is linear and 
bounded at y^^\ By lemma 5, 5Tis on {y^^^)a^U io U 
and is bounded in y and linear in -q. Therefore, by 
lemma 3, there exists 0:2 such that bT~'^ exists on 
{y^^^)a^ and is of class C in y uniformly on (y^^^)a^. 
Thus by lemma 2, there exist nonnegative constants 
^3 and M such that for y^^\ y^^^ in {y^^^)a^ and 

\\y' 



-y 



(2)1 



I < a^ it follows that 



Consequently, if we choose ai = as/2, a part of condi- 
tion (I) of section 4 is satisfied on (^^°0«4- 

By an argument similar to that applied in section 
4, it can be shown that there exist nonnegative 
constants as, B such that 



\\dT-\x,y;n)\\<B\\v\ 



(59) 



for all y in (y^^^)a^ and all tj in U. If a^ is taken to be 
less than or equal inf (a^,a5), and such that (^^^0«6 
contains only admissible arcs, the part of condition 
(I) previously proved and inequality (59) are still valid 
on (y^^^)aQ, that is, condition (I) holds on (^/^^O^g- 
The existence of nonnegative constants H^JL, N, 



which along with the nonnegative constants 5, M 
already determined in this section satisfy relation 
(21) is clear. By lemma 13 there also exist a^ and U 
such that 

I \bT{x, y^'^; y^'^-y^'^) - T(x; y^'^) + T{x', y^'^) \ \ < 

L^\y^')-y'n' 

for all j^'\ y(2) in (y^^Oa^. That is, condition (IV) 
of section 4 is valid on {y^^^)a^. Thus we may choose 
oL^<a'! such that conditions (I) and (IV) both hold 
on (y^^Ottg- -^y continuity there exists a^ such that 
\\T{x]y)\\<D on (2/^^Oa,. Let C be such that 
0<C^<inf (ofsj^g)- Then if the constant D is less 
than or equal to the smaller of N and {l—H)C/B, 
condition (III) is satisfied on {y^^^)a • Conse- 
quently, if the positive constant 7 in theorem 3 is 
defined to be the inf (0^8,0^9), we may conclude by that 
theorem that Newton ^s method converges quadratic- 
ally to a unique element y of {y^^^)a, oi<y, which 
is such that (3) is satisfied. That 

y{a)=y^'^{a), y(b)=y^'^{b) 

follows at once from the fact that all the variations 
7]^^^ vanish at x=a and x=b. 

The arc y is of class C^ by virtue of being in Y. 
To see that y is actually of class C^ consider the 
equations in {x,v) 

Gi(x,v) = gi{x,y{x),v)- hlt,y{t),y\t))dt— 

J a 

b^^ja \j'^^' ^^^^' y\^))- 

^\{s,y{s),y\s))ds']dt={) (^=l,2, . . .,n). (60) 

By what has just been proved the point {x,y'{x)) is 
a particular solution of (60) for each x on a<x<b. 
Since for each such x, (x,y{x),y^(x)) is an interior 
point of the region R in which g and h are of class C^, 
G(x,v) is of the same class in a neighborhood of each 
point (x,y^(x)). By hypothesis the Jacobian 

\Giv.\ = \giy;.\9^0 

in the region R. Thus the implicit function theorem 
(see Bliss [2, p. 269]) is applicable. This theorem 
tells us__that y^(x) is of the same class as G{x,v), 
that is, y(x) is of class C^. 

The following theorem gives further conditions 
under which the conclusions of theorem 4 will be 
valid. 

Theorem 5. Let y"^ be an admissible arc with non- 
conjugate end points such that T{x;y^) = B. Then there 
exists a constant a>0 such that for any initial estimate 
y^^^ taken from {y'^)a the conclusions of theorem 4 ^^^ 
valid. 
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By reasoning similar to that employed in the 
proof of theorem 4 there exists /?>0 such that condi- 
tions (I) and (II) and (IV) arc satisfied on (2/*)^. 
By continuity fi may be made to have the additional 
property that \\T{y)\\<N for all y in (y*)^. Let a 
be such that 0<a</3/2. Then for any y^^^ belonging 
to (?/*)« it follows that {y^^^)^j2 is contained in (2/*)/3- 
Thus (I), (II), (IV), and the inequality \\T{y)\\<N 
hold on (^^^0/3/2. Let C be such that 0<C'<iS/2. 
Then by continuity a may be taken so small that for 
all y^^^ in (^*) Jl T{y^'^^)\\ < {l'H)C/B. This says con- 
dition (III) also holds on the (i3/2) -neighborhood of 
any y^^^ taken from (?/*)«. We now invoke theorem 3 
and the remainder of the proof follows as in theorem 
4. 

If one wishes, he may hold y^^^ fixed in 8T(x,y;7]) 
throughout the entire process. Theorem 2 is ap- 
plicable in this simplified case, and with its help all 
of the previous results can be shown to be true for 
the simplified Newton's method. 

10. A Lemma Concerning Conjugate Points 
It is easily seen that the equation 

ST(x,y; v)= \ \joS,y; v)~ \ 8hi(s,y; r))ds dt— 



{i=l,2, 



.,n) 



(61) 



and eq (40) of definition 5 arc equivalent. Thus in 
the considerations of this section it will be assumed 
that (40) has been replaced by (61) in the definition 
of conjugate points. 

If y is an arc such that 8T~^(x,y;rj), the inverse 
with respect to rj of 8T{x,y;rj)y exists for all rj in U, 
then y has nonconjugate end points, because the 
unique element of t/ which 8T(x,y;rj) maps into ^ is ^ 
itself. In lemma 9 it was shown that if y has non- 
conjugate end points, 8T~\x,y;r]) exists. Thus a 
necessary and sufficient condition for 8T~^(x,y;'q) to 
exist for all rj in U is that y have nonconjugate end 
points. However, by lemma 3, if 8T~^ exists at y^^^ 
it must exist in some neighborhood of y^^\ There- 
fore, the following lemma may be looked upon as a 
corollary of lemma 3. 

Lemma 14. Let y^^^ be an admissible element of Y 
having nonconjugate end points. Then there exists 
5>0 such that no elements of (^^^Os have conjugate end 
points. 

In this section we wish to give an alternate proof 
of lemma 14, which depends directly on the proper- 
ties of functions with Lebesgue square integrable 
derivatives rather than on the Banach space theory 
of section 3. 

To construct a proof of lemma 14 by contradiction, 
assume that there exists no neighborhood of y^^^ in 
which all elements have nonconjugate end points. 
Then in every neighborhood of y^^^ there exists at least 
one function y such that (61) has a nontrivial solution 
along y which vanishes at x= a and x = b. Consequent- 



ly, from consideration of the i/g-neighborhoods oiy^ 
for 2= 1,2, . . . one obtains the sequences {t/^^^ ] 
which are such that 



Aq) 



}, 



y ^^^ (x) -^y ^^^ (x) uniformly on a < a: < & , 

(^=1,2,...) 

y^^^ \x)-^y^^'^ \x) uniformly on a<x<b, 

(g-1,2, ...)J 



8T{x,y^^^]7]^'^)=d 



Aa) 



(«)= 



^lUh^^f 



\h) 



(5 = 1,2, 



•) 



.). 



(62) 



(63) 



Since all arcs rj under consideration belong to the 
space U they must certainly belong to the class of the 
totality of arcs in (x, rji, . . . , 77n)-space defined by 
a set of n real valued functions 



7ji(x) (a<x<b; ?^ = 1,2, 



.,n) 



that are absolutely continuous and have integrable 
square derivatives 7j^(x) on a<x<b. Henceforth, 
we will think of the rj^s as belonging to this class. 

Without loss of generality one can choose 7;^^^ such 
that 



r 



,,/'>'• !,,<»' '(ia;=l (g=l,2, ...) 



(no sum oil q). 
(64) 



Equations (64) imply the existence of a subsequence 
{ r; ^<^;^ ' } , which converges weakly in L^ to some element 
^(0)' ^j\]^Q (.^^ assume that this subsequence is the 
same as our original sequence. The weak conver- 
gence in L^ of the sequence {77^^^'} implies 



Vi 



,(a)' 



>T]J:^^ uniformly on a<x<b (i = 1 , 2, . . . ,7i) 

(65) 
vr -^vP^ ' weakly in i^ (i = 1 , 2, . . . ,71). 

It is also true that 

nb r*b 

Vi^^ '• 'jf ^ 'd^ ^ lim inf -q^"^ 'r?/^^ 'dx (no sum on q), 

J a g-^00 J a 

(66) 

A proof of this last fact has been given by McShane 
[4, p. 355]. 

Inequality (66) will be used in the proof of the 
following lemma. 

Lemma 15. Let [rj^^^'} be a sequence in L^ such 
that (64.) and (65) are satisfied and let A,^f(x), Bif{x) 
(i,j =1,2, . . . , n) be continuous functions for almost 
all q such that 



A,^f -^A^f uniformly ona<x<b 



(i,j=l,2,. . .,n) 
(67) 
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BIf -^Blf uniformly on a < a; < & 



(i=l,2, 



Then if 

Wl'\x)= r(^W^,!«>+5/f„,<»''M« (2=1,2, . . ., 

J a 

^=1,2, 
it follows thai 

PFi "' _> pp w) uniformly on a<x<b 



,n) 



,n) (no sum on q), 



{i=\,2,...,n) 
(68) 



W/"' ' -^ Wi«" ' weakly in U 



+ 



(i=l,2, ...,n) 
Consider 

\Wi{x)-Wt\x)\ = 

iJa 1 l./a 

The first of the four terms on the right of the sign of 
inequality satisfies 

I {\Alfr,l''-Aifr,f)dt\< V\Alf-A!f\.\-nl'''\dt 

\Ja \ J a 

<(6-a)max \Alf -Alf[\r)f\. 

By (67) max \Alf -A^f \ -^0 for each (^,i= 1, 2, ..., n). 
By (65) [ry/^^l is uniformly bounded for all j, q 
under consideration. Thus the first term goes uni- 
formly to zero on a < X < h. The second term on 
the right satisfies 

I {\Alfn^-'-Alf^r)dt\< {'\Alf[Uf-yir^\dt 

\J a \ J a 

<msix\Aif\ f \nf-y)f\dt. 

J a 

This goes uniformly to zero by (65). For the third 
term we have 

\J a \ J a 

<m&K\Blf-Blf\ {" \r,^'''>'\dt. 

J a 

By the Schwartz inequality and (64) 

'»6 \2 rb 



( {\y)f'\dt^<{h~a) {\^'''^dt<{h-a) 



(g=l,2,...). 



Thus 1 7? /«^' 1 6?^ is uniformly bounded f or (j=l, 2,..., n) 

Ja 

(q'=1,2,...). Consequently, one may conclude 
from (67) that the third term goes uniformly to zero 
on a<x< h. 

Only the fourth term remains to be discussed. Let 
j be temporarily held fixed. As is well known there 
exists for each arbitrary €>0 a polynomial P{x) such 
that 



{Bf]{x)-P{x))Hx< 



16 



(69) 



By (64) J (66), and the Minkowski inequality it is 
true that 



{^\^fyd^)^+{ \''wrydxj<2. (70) 

Thus by use of Schwarz' inequality, (69), and(70) 
one obtains 

I {'Bf}{y^f-rtr)'dt- rp{t){vr-vrydt 

\J a J a 

< (^£ {Bf: -py dty(^£ iv r-v rr dtj (7i) 
<(JliBf^-pydt)\j\,r-vr)dty<i 

for all a; on a <x< 6. Integration by parts yields 

r-vr)'dt 



JJ^(0(r 



=p{x){r,r-vr)- rp'{t)ir,r-vr)dt. 

J a 



Therefore, it follows from the uniform convergence 
of (v^f^ — vT) to zero that 



\J a 



< - uniformly on a<x<h 

(72) 



for q sufficiently large. Formulas (71) and (72) 
imply that 



I {'Bf^{nr-^f')dt 



>0 [uniformly on a<x<h. 



This completes the proof of the first statement of (68). 
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Let I be any element of U and consider 

\Vw'^''ldx- {" W'"'' ldx\< 

I Ja I \j a 

<max !^!JJ^'Vif'?S'''-^i°''7ri<^^ + 

\J a I 

By a repetition of the arguments already given above 
the terms multiphed by max \l\ go to zero. The 
factor I may be himped with Bf- without changing the 
reasoning used in analyzing term four. Thus the 
second statement of (68) is seen to be true. 

The following corollary is a direct result of lemma 
15. 

Corollary 1. Let the hypothesis oj lemma 15 
hold and let 

W^\x)^W'^\x)-^^^ Wf{b) (73) 

V'^\x)^ C W^'it^dt-^f^ Cw'^^'ifitdt. (74) 



The7ij 

U'f -^ UT uniformly ona<x<6 (i=l,2,. 

y(.) _>y(o) uniformly on a < a: < 6 (^=-l,2,. 

U'^''-^UT' weakly in U (i= 1,2,. , 

Vi'^'^Vf uniformlyona<j<6 (i==l,2,. 

Let 

J a a J a 

Vi(x,y',r))= I (\ dhidsjdt— 

Then u and v are elements of U satisfying 

8Ti(x,y;r))=Ui—Vi (^=l,2, . . . ,?i). 



. ,n) 
,,n) 
. ,n) 
. ,n). 

.,n) 



,,n). 



(75) 



Since ^ dgi{t,y^''^:rj^^^)dt and ^ 8hi(t,y^''^;r)^'^)dt are 

Ja J a 

functions of the type Wf^x) defined in lemma 15 for 
each admissible t/^^^ in Y and t?^*^' in L^,Ui{x,y'^^^;r]^^^) 
and Vi{x,y^'^^;rj^'^^) are respectively of the types (73) 
and (74). Consequently, if we take {y^^^} and {rj^^^} 



to be the sequences which result from contradicting 
lemma 14, the hypotheses of lemma 15 will be fulfilUMl 
because of (62) and (65), and corollary 1 will then 
imply 

u,{xy^^;v'^')-^u,{x,y^'^;rj''^) 

uniformly on a < X < 6 (i=l,2,. . . ,n) (76) 

v,(x,y^^^;rj^^^)-^v,{x,y^'^;rj^'^) 

uniformly on a < a:: < 6 (^=1,2,. . , ,n) (77) 

u-{x,y^'^;'n^'^)->Ui{x,y^'^^;'n^''^) 

weakly in U {i=l,2,. . . ,n) (78) 

v'i{x,y^'^;'n^'^)^Vi{x,y^^^;'n^'^^) 

uniformly on a < ic < 6 (i=l,2,. . . ,n). (79) 

In view of (63), formulas (76) and (77) yield the 
result 



dT{x,y 



(0) .„(0) 



(80) 



The limit function rj^^^x) is clearly a continuous 
function on a<x<b satisfying 

ri^'^(a)='n^'\b) = d. 

To show that rj^^^ is of class c' that is rj^^^ is in t/, 
observe that by differentiation of (80) 

OiVjVi -rgiy'.Vi ~ 

r dhidt+j-^ j [dcji— j dhidsldt. (81) 

Since |^ij/;.|F^(), (81) can be solved for ry^^^'. As all 
other quantities involved in (81) are continuous, 77^^^' 
must have the same property. 

By the hypothesis of nonconjugate end points the 
only element of C7 satisfying (80) is the zero element. 
Therefore 7}^^^ = e and (78) and (79) may be amended 
to read 

<(x,y^^^;7i^^')-^u,(x,y^'^;v^'^) 

= weakly in X^ (i=l, 2,. . . ,71). (82) 

= uniformly on a<x<b {i=l,2,. , , jU), (83) 
Consider 

lim r 8r,(x,y^'^ ;7j^''^)dr,(x,y^'' ;'n^9^)dx 

=lim I ul'^^'u^,"^' — 2nl''^' v^"^^' + vi''^' v^''^')dx. (84) 

Although we have not previously defined all the 
notations in (84) their meanings are obvious. By 
(82) and (83) the terms involving v'i may be disre- 
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garded. By lemma 15 

lim I ul^^'u^^^^'dx^ 

limf r^g^'^'dg^^'^^dx-^^f Cdgj^'dx^f Ps^i^^^^l 

=lim dg,^^^dg,^'^dx. 

Multiplying out, one obtains after dropping the 
superscript (q) from the derivatives of ^^ 



lim 



im I dgidgidx 
=lim (giy.Vi'^ givj^Vk'^ + 2giy.Vi''^ giy^Vk''' + 

q-^co J a 

=lim r<7^^,,,7j<"'7,i«''c?:c=lim f ^,<l'„/'''„^>'(Zx 

since all terms involving 77^*^ go to zero. 

Let (Aij) denote the transpose of the matrix 
(Aij). Then (since by hypothesis det (g-'J.) ^0). 

det(^/f )=det (gl:).) det(^) = [det (^/:;.)]'>0. 

Hence G}f is ^ real, symmetric, positive definite 
matrix for each 2=1,2, . . .. Therefore, it follows 
that 

lim {%T-{x,y^'^;rj^'^)8T;{x,y^'^]ri^'^)dx== 

lim rG}l'vl"'vt''''dx>C>0. 

q-^caj a 

However, (63) implies that this last limit is identi- 
cally zero. Consequently, our assumption leads to 
a contradiction. Therefore there must exist at least 
one neighborhood of y^^^ in which all elements have 
nonconjugate end points. 

11. A Solution of the Fixed End Point Problem 

To conclude the discussion of Newton^s method, 
we wish to point out that with certain additional 
hypothesis we can make use of our previous results 
to obtain a solution of the fixed end point problem 
of the Calculus of Variations. 

Let/(x,:?/,2/0 be the integrand of the integral (1). A 
strengthened Legendre condition is said to hold in the 
region R if 



Uv'^X^^y,y')'^i'^j>o 



(85) 



for all {x,y,y') in R ^and all (7r)^0. This implies 
that the determmantl/^/^.yJ p^O in R and is thus 



stronger than our previous condition that the 
determinant |^i/.| never vanishes in R. 

The following terminology is well known and will 
be helpful. An arc y is said to satisfy a strengthened ^ 
Jacobi condition if it has nonconjugate end points 
and has no subarc defined on a<x<^<Ch which has 
conjugate end points. 

Let (2) be replaced by the Euler equation (13). 
Let the region 7? be convex in y^ and let the strength- 
ened Legendre condition hold everywhere in R. 
The following theorem is then true. 

Theorem 6. Let y^^^ he an admissible element of 
Y having nonconjugate end points. Then if there 
exists a sufficiently small constant D>0 such that 
\\T{x,y^^^)\\<CD, the sequence oj estimates derived from ' 
Newton^s method will converge quadratically to a 
unique arc y oj class & which lies within a predeter- 
mined neighborhood of y^^\ joins the end joints of 
y^^\ and satisfies the Euler equations. Moreover, if y 
is such that no interior point of a<x<b is conjugate to 
x=a,y will afford a strong relative minimum to I(y). 

By theorem 4, Newton's method converges quad- 
ratically to an arc y of class c^, which joins the end 
points of y^^^ and satisfies the Euler equations. The 
arc y also has the property of nonconjugate end 
points. This follows from lemma 14, which implies 
that the constant ag mentioned in the proof of 
theorem 4 can be chosen to have the additional 
property that no element of (y^^^)a^ has conjugate 
end points. Since by hypothesis no subarc of y with 
initial point at x=a has conjugate end points, a 
strengthened Jacobi condition is satisfied along y. 
As it has also been assumed that a strengthened 
Legendre condition holds throughout R and that R 
is convex in y\ we may conclude (see Bliss [2, p. 42, 
corollary 16.1]) that y affords a strong relative 
minimum to lly). 

The following theorem gives further criteria that 
insure the convergence of the sequence of estimates 
derived from Newton's method to a solution of the 
fixed end point problem. 

Theorem 7. Let a minimum ?/* exist and have 
nonconjugate end points. Then there exists a constant 
a>0 such that for any initial estimate y^^^ taken from 
{y'^)a the conclusions of theorem 6 hold. 

Upon recalling that a necessary condition for ?/* 
to be a minimum is T(x;y'^) = d, one proceeds with 
the proof just as in theorem 5. 

12. A Second Method for the Effective Solu- 
tion of the Fixed End Point Problem 

In section 2 it was remarked that in the application 
of Newton's method, one must solve the Euler 
equations of each term of a sequence of integrals 
whose integrands are quadratic in y and y\ Thus 
it is not inappropriate to introduce at this point a 
second method for solving fixed end point problems 
which will prove to be particularly applicable to the 
case of a quadratic integrand. In an actual compu- 
tation a combination of Newton's method, and the 
method to be presented below, might prove to 
be advantageous. 
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Henceforth the symbol )' will dejiote the class of 
7/-tupk^s of real valued functions on a<x<b which 
have Lebesgue square integrable derivatives. Use 
was made of this class of functions in section 10. 
The symbol U will again denote the subclass of 
elements of F, which vanish at the end points of 
a<x<b. Throughout the remainder of the paper 
we assume that all functions of Y are admissible and 
that the underlying region R is convex in y and y\ 
Our previous assumptions as to the nature of the 
integrand f{x,y,y^) are retained. Moreover, we 
assume that / and its derivatives are integrable 
along arcs in Y. 

Assume that after starting from some initial 
estimate y^^^ in Y one has obtained an ^^improved^' 
estimate of a solution to the Euler equations y^^^ 
also in Y. He then chooses the 'i!+lst estimate to be 



yii + l)=yii)_^.^ii) 



where rj 
equation 



(no sum on i) , (86) 

^*^ is an element of U determined by the 

rjii) = T(x;y^^^), (87) 



Here T is formed for the Euler equations as illus- 
trated in section 2, and at is a real number. 

A procedure similar to that just described has 
been announced by L. V. Kantorovitch [5] for the 
case of an integral whose integrand is quadratic in y 
and y\ However, his paper pn^sents no proofs. 
Curry [6] has described an analogous process that 
can be used to minimize a function of n real variables. 
Other references may be found in the last paper cited. 

In order to consider the convergence of the 
sequence defined by (86) it is helpful to examine the 
expression I(y) — I(y — ar}). We may state the 
following lemma. 

Lemma 16. Let y be an element of Y and let rj be 
given by (87). Then, 



where 



I(y)-I{y-arj)=a ^ rjlv'^dx- a' {^ Wdx (88) 
J a J a 

W =Aij7]i'r]j-\-2Bij'()iri'^ + Cij7][r)] 
-4o== J^ {\—p)fy.y.{x,y — aii't],y' — aixri')dtx 

and Bij, dj are obtained Jrom the formula Jar Aij by 
replacing Jy.y. by fy.y! andjy^y: respectively. 

An application of Taylor's theorem results in 

I{y)—T{y = a'n) 

= «f lfyX^,yyy')n+fyi(^,yyWi]dx — a^( Wdx. 
By integration by parts 

£ A V idx = -£ [ j; fydt+C,] vldx ; 



and since 

j^ CiVidx = Ci[rii(b)-rji{a)] = 0, 
the choice of Ci is arbitrary. Accordingly, we choose 

(^i'=JZI^\^ \jvp,yy)- j^ fvi{s,yy)dsj dL 

Thus one may write 

I(y)-I(y-arj)=a C T'.rj^dx-a^ C Wdx 
J a J a 

= a I r]i7]ldx — a^ \ Wdx. 
J a J a 

The next lemma follows directly from formula (88). 

Lemma 17. Assume that for every number v there 
exists a number K^O and a number a* such that if 
I{y)^v and 0<^a<a'^ then 



Vwdx<K Ww.dx. 

Ja J a 



(89) 



Then in case I{y)<v and, 0<Ca<a* one has the 
relation 

I{y)-I{y-ar,)>a(l-aK) C nWidx. (90) 

J a 

Formula (90) helps to prove the following theorem. 

Theorem 8. Let there exist finite 7iumbers v, 
K^O, a* such that if L(y)<v and 0<Ca<a'^, formula 
{89) holds. Let y^^^ be such that I{y^''^)<v and] let 
a and a be chosen so that 0<^a<a<iljK. Then if 
I(y) is bounded below on the class of arcs in Y joining 
the end points of y^^\ the sequence of estimates {y^^^} 
defined by (86) and (87) is such that {I(y^^^)} is 
convergent. Furthermore, 

T(x;y^'^)-^e uniformly ona<x<b (91) 

T(x,y^'^)-^e inU. (92) 

That a may be chosen as specified in the hypoth- 
esis is a consequence of the fact that K is finite. As 
a result of such a choice of a, one obtains the 
inequality 

/(2/<°))>/(2/<°'-aoV'")^/(2/("') 

from (90) by setting y = y^^\ a = ao- Since L(y^^^)<Cvt 
the argument can be repeated provided rj^^^^d. 
Hence a sequence {y^^^} such that 



/(2/"')>/(2/"+") ii = 0, 1,2, 



.) 



is constructed. Due to the hypothesis that I(y) is 
bounded below the sequence of definite integrals 
converges. 
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By (90) and the way {«,} is chosen 

Ja 



(93) 



>a{l—Ksup at)\ 7)/'''i?,5"'c?a;>0, 

J a 



where here and in similar expressions that occur 
below there is no sum on the index that denotes 
which step of the iteration process we are considering. 
As the left-hand side of (93) can be made as small as 
desired by choosing i sufficiently large, we may 
conclude that 

lim^ f r]P'rjj'''dxy = 0' (94) 

Conclusion (92) follows at once from (94). Since 
(91) is a consequence of (92), the theorem is proved. 
The problem of determining general conditions 
under which an assumption of the type made in 
lemma 17 is valid seems to be a difficult one. How- 
ever, special cases may be considered in which K is 
independent of v and a* is arbitrary. A rather 
trivial example of this occurs when the integrand is 
given by 

f{x,y,y') = g(x)^Ji+y^\ g{x)>0. 

If 2K> \g(x)\ on a <x<b, we have 

^^ ^ ' ^ d^xdx 



J'b fb ri 

Wdx= (1- 

a J a Jo 



m) 



1+(1/'- 



-afxv7^^ 



dx- 



A much more important case is that in which the 
integrand is quadratic, that is, 

f{x,y,y')=Aij{x)yiyj+2Bij{x)y{y] + Cij{x)y[y] + 

D,{x)y,+E,(x)y: + F,ix). (95) 

ForJ(x,y,y^) given by (95) we obtain upon integrating 

out JJL 

rb rb 

I Wdx= [Aij{x)Tjirjj+Bij(x)rjir]'j + Cij{x)rjirj',-]dx 



<K 



J a 



'dx 



for a suitably chosen constant K provided r}i{a) = 
Vi(b) = Q. This last result follows from the Schwarz 
inequality and the fact that the integrand is a 
quadratic form in tj and rj^ having continuous 
coefficients. 



By setting y=y 



(0) 



v^y-y 



(0) 



- 1 in the 



computations employed to establish lemma 16, one 
obtains the formula 

I(y)=I(yn+ rT:(x;y^'^Widx+ C Wdx, (96) 

J a J a 



which is true for all ^ in F which join the end points 
of y^^\ This equation is helpful in the proofs to be 
given below. 

Lemma 18. Let there exist a constant k^O such that 

r Wdx>k i\-Vidx (97) 

J a J a 

for all 7} in U and all y in Y. Then I (y) is bounded 
below on the class oj elements oj Y joining the end 
points of some arbitrary fixed y^^^ in Y. Furthermore, 
given a constant v there exists a constant M such that 
for each y in Y joining the end points of y^^^ for which 

^i{y)<v 

r 7)['n[dx<M. (98) 

J a 

In (98) we have T) = y—y^^\ 
By Schwarz' inequality 

rr;(:K;2/<°>)„;< 

J a 

= h ( I nWidxY- 
Thus by (96) and (97) 

provided y joins the end points of y^^\ This proves 
that I{y) is bounded below on the class of functions 
joining the end points of y^^\ The remainder of the 
lemma is also a ready consequence of this last 
inequality. 

Condition (97) does not seem to be very widely 
applicable. However, in case the integrand is 
quadratic the strengthened Jacobi condition in con- 
junction with the strengthened Legendre condition 
imnlies the existence of a constant k^O such that 
(97) holds. 

Lemma 19. Let {y^^^^ } be a sequence in Y such that 



yi 



(q)_ 



>yi uniformly on a<a:<6, (i= 1,2, 
>y[ almost uniformly {i=\,2, 



. .,n) 
(99) 
Vi^'-^yi almost uniformly (i=l,2, . . . ,n) 

(100) 

Ti(a;;?/<")^0 almost uniformly {i=l,2, . . . ,n). 

(101) 



Then 
T'i(x;y)=0 almost everywhere (i—1,2, 



,n). 
(102) 
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Let / denote the interval a <x<h. Since y^ is in 
L^ it is suinmable and licnce finite^ almost everywhere. 
Hence given €>0 there exists by (100) a set Si of 
measure less than e/2 such that yl^'^'-^y'i uniformly 
on I— Si for (^=l,2,. . .n). Of course, Si must con- 
tain all the infinites of y\ The summability of y^ 
also implies the existence of a set S2 of measure less 
than €/4 such that y'i is bounded on /— ^2 for 
(^=1,2,. . .,7i). Therefore, if Sz is an open set of 
measure less than e containing /Si + S'2, the following 
statements are valid : 

(a) I—Sz is closed and bounded, 

(b) 1/i is bounded on Z—Ss (^=1,2,. . .,7i), 

(c) yt'-^^-^yj uniformly on I—Sz (^==l,2,. . .,?i), 

(d) i//«^ '->?/■ uniformly on/— Sa (^=1,2,. . .,n). 

By (a) through ((/) it can be concluded that 

fy^(^,y^'\y^'^ ')-^fv[{^, y, y) uniformly on /- Sg 

and similarly for Z^. . Hence it readily is verified that 
T-{x;y^'^)-^T-(x,^y) uniformly on /-& for {i= 
I 1,2,. . .,7i), that is, almost uniformly and thus almost 
everywhere. Conclusion (102) is then obtainable 
from (101). 

If one wishes, he may express (102) in the form 
T{x;y)=d. This follows directly by integration. 

Lemma 20. Let y be an element of Y satisfying 
{102). Then ij there exists kyO such that (97) holds, 
y affords a unique minimum to I(y) on the class of 
functions in Y which join the end points of y. 

Let y be any element of Y joining the end points 
of y. By (96) 

I{y)-I(y)= p T[{x',y){y[-y[)dx+ V Wdx, 

Ja J a 

By hypothesis Ti(x;y) = Q almost everywhere and 
y'i— y'i is in U. Thus the first integral on the right 
is zero by Holder^s inequalit^^ Therefore, if y^y, 

I(y)-I(y)= f Wdx>kr{y:-y:)(y:~-y:)dx>0. 

J a J a 



Consequently, 



l(y)>l{y). 



(103) 



Assume that y also satisfies (102). Then by revers- 
ing the roles of y and y one obtains I(y)yi{y). 
This contradicts (103), and so implies that the as- 
sumption that y^y satisfies (102) is false. Hence y 
cannot be a minimum. 

Theorem 9. Let there exist finite numbers v, K^^O, 
a* such that i^ I (y)<v and 0<^a<^a'^<Cl/K, formula 
{89) holds. Let t/^^^ be suchjhat I{y^''^)<v and let a 
and a be chosen so that 0<ia<a<^a*. Then if there 
exists a constant ky>0 such that {97) is satisfied, the 
conclusions of theorem 8 are satisfied. Furthermore, 
there exists a function y such that for the sequence 



[y^^^] described in theorem 8 we have 

yd) _^y uniformly on a<x<b 



y 



(i)' ^T/ 



y^ in the mean of order 2. 



The limit y affords a unique minimum to I{y) on the 
class of functions in Y joining the end points of y^^K 

By lemma 18, I{y) is bounded below on the class 
of y in Y joining the end points of y^^\ Since all of 
the other hypotheses of theorem 8 have been as- 
sumed, the conclusions of that theorem must be valid. 

As the sequence {y^'^^} of theorem 8 is such that 
each y^^^ joins the end points of y^^^ and 

i{y''')<l{y''')<v (i=i,2,. . .), 

formula (98) of lemma 18 is applicable to each term. 
This formula implies the existence of M>0 such that 
for all n, 7n 

where r;^]*-'"^ =yin)_yim)^ Yot n^m, I{y^''^)—I{y^'^^) 
is negative. Thus from substitution in the main 
formula of the proof of lemma 18 it follows thai 

0<ki r]^l'''^^'r]^r''^^'dx<hj i r]^r'"^'v^r'"^'dxjj 



where 



Jim^(^jy:{x;y^'^')T:{x',y^^^)dxy-^0, 



n^m, 



m— >oo. 



Consequently, the existence of y such that y^^^'-^y' 
in the mean of order__2 follows from 104. This last 
result implies y^^^-^y uniformly on a<x<b. By 
recalling (92) and taking subsequences one finally 
obtains a subsequence {t/^*^} such that 

y(Q).^y uniformly on a<x<b 

yiQ)'^^y^ almost uniformly 

T' {x',y^^^)-^d almost uniformly. 

Consequently (by lemma 19) y satisfies (102). That 
y is unique is a consequence of lemma 20. Hence the 
theorem is proved. It is not difficult to show that 
y{x) is of class c^. 

It is clear that these results apply to the case when 
/ is a polynomial of degree two in y^ . 

13. An Example of Numerical Computation 
Consider the problem of minimizing the integral 

I{y)=ry^l+y'"dx, y>0, 
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whose Euler equation is 



y" 



y 



(105) 



To apply Newton's method write (105) as 

5T=2y'v'-y"v-yv", 



Then 



and the differential equation T-\-dT=0 can be put 
into the form 

v"=Av+Bv' + C, 
where 



y 



B=K c='+y^ 



y" 



y y 

The general solution of (105) is given by 
y=o cosh — r — 



(106) 



Given two points {xi,yi) and (^2, 2/2) three [cases can 
occur. 

(1) Two catenaries (106) join the given points. 
In this case one will be minimizing and the other will 
not. 

(2) One catenary (106) joins the given points. 
This catenary is not minimizing. 

(3) No catenary (106) joins the given points. 
The examples for computation were selected with 
this in mind. 

Example I: 

The points {xi,yi), (0:^2,^2) are respectively 
(—2, 3.086) and (2, 3.086). The solutions are 



y = 2 cosh- 
7/= 1.404 cosh 



X 



1.404 



where the first is minimizing and the second is not. 
The results of computation are presented below in 
tables 1,2, and 3. In these tables as well in tables 
4, 5, y denotes the exact solution, y^^^ the initial ap- 
proximation, and y^^^ the approximation at the ith 
step. The symbol Cf denotes the value of 



c = 



y 



-y 



along the curve y^^\ Along a solution of (105) we 
have c=0. The function Vt found in tables 2 to 5 is 
a solution of the equation 5c ==0, which results from 
taking the variation of the equation Ci=0. The 
evaluation of Vi can be arranged to be a by-product 
of the computation of y^^'^^\ It follows from the 
theory of conjugate points that y will be a minimizing 
arc if for d sufficiently small Vi does not vanish more 
than once. If Vi vanishes twice, y is not minimizing. 



It is of interest to note that convergence to a solu- 
tion of the Euler equations that was not a minimum 
was obtained. This shows the necessity of making 
additional assumptions (as was done in section 11) 
in order to insure convergence to a minimum. 

Example II: 

The points (xi, i/i), (^2, ^2), are, respectively, 
(0, 2.5894) and (5, 5.9284). * The solutions are 



y=2 cosh- 



■1.5 



2/= 1.8883 cosh 



x-1.5806 

1.8883 



The first arc is minimizing and the second is not. 
It will be noticed that although the two solutions 
are quite close to each other convergence to each one 
was obtained. 

Example III: 

The points (0^1,^1) and (0:2,^2) are, respectively, 
(0,2), (5,3). In this case there is no solution. After 
an initial estimate consisting of the line joining (0,2) 
and (5,3), the second iteration resulted in points 
{x,y) lying outside of the domain in which t/>0. 



The material in this section was supplied to the 
author by M. R. Hestenes, under whose auspices the 
computations were carried out. 



Table 1 



2/=2cosh:r/2, ?/(o) =3.086 {-2<x<2) 



X 


y(0) 


2/(1) 


y{.2) 


2/(3) 


2/(4) 


y 


ro 


C\ 


C2 


CJ 


C.i 





3.086 


2. 138 


2.132 


2.016 


1.994 


2.000 


.324 


.086 


.018 


.002 


.000 


±.2 


3.086 


2.447 


2.140 


2.026 


2.004 


2.010 


.324 


.087 


.019 


.002 


.000 


rt.4 


3.086 


2.464 


2.167 


2.056 


2.034 


2.040 


.324 


.087 


.020 


.003 


.000 


±.6 


3.086 


2.497 


2.213 


2.106 


2.085 


2.090 


.324 


.092 


.021 


.003 


.000 


±.8 


3.086 


2.542 


2.277 


2.177 


2.157 


2.162 


.324 


.096 


.023 


.003 


.000 


±1.0 


3. 086 


2.600 


2.360 


2.269 


2. 251 


2.255 


.324 


.101 


.025 


.004 


.000 


±1.2 


3.086 


2.672 


2.463 


2. 383 


2.3b7 


2.371 


.324 


.107 


.028 


.004 


.000 


±1.4 


3.086 


2.756 


2. 586 


2.320 


2.007 


2.510 


.324 


.114 


.031 


.005 


.000 


±1.6 


3. 086 


2. 853 


2.730 


2.682 


2.673 


2.675 


,324 


.121 


.034 


.006 


.000 


±1.8 


3. 086 


2.963 


2. 897 


2.870 


2.865 


2.866 


.324 


.128 


.038 


.006 


.000 


±2.0 


3.086 


3.086 


3. 086 


3.086 


3.086 


3.086 


.324 


.136 


.042 


.007 


.000 



Table 2 





y= 


2ccsh(z/2), 2/(0) =1.886+ .30:2 ( 


-<z<2) 






X 


yiO) 


2/(1) 


2/(2) 


y 


CO 


Cl 


Co 


Vl 





1.886 


2.053 


2.003 


2.000 


-.070 


.010 


.001 


1. 000 


±.2 


1.898 


2.063 


2.013 


2.010 


-.066 


.010 


.001 


.995 


±.4 


1.934 


2.092 


2. 043 


2.040 


-.053 


.010 


.001 


.981 


±.6 


1.994 


2.140 


2.093 


2.091 


-.034 


.009 


.001 


.957 


±.8 


2.078 


2.208 


2.165 


2.162 


-.008 


.009 


.001 


.923 


±L0 


2.186 


2.297 


2. 258 


2. 255 


-.022 


.008 


.001 


.877 


±1.2 


2.318 


2.407 


2.373 


2.371 


.055 


.007 


.001 


.818 


±1.4 


2.474 


2.540 


2.512 


2.510 


.089 


.006 


.001 


.745 


±1.6 


2.654 


2. 696 


2.676 


2.675 


.124 


.005 


.001 


.656 


±1.8 


2.858 


2.878 


2.867 


2.866 


.158 


.003 


.001 


.548 


±2.0 


3.086 


3.086 


3.086 


3.086 


.191 


.002 


.001 


.419 
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2/ = 1.401 cosh 



1.404 



Table 3 



y (0) =1.286+.45.r2 (-2<.r<2) 



X 


y(0) 


yd) 


y(2) 


y 


ro 


C\ 


^2 


V\ 





1.286 


1.367 


1.407 


1.404 


-.123 


.010 


.000 


1.000 


±.2 


1.304 


1.381 


1.421 


1.418 


-.108 


.010 


.000 


.989 


±.4 


1.358 


1.425 


1.464 


1.462 


-. 068 


.010 


.000 


.957 


db.6 


1.448 


1.499 


1. 537 


1. 534 


-.008 


.009 


.000 


.900 


±.8 


1.574 


1.604 


1.640 


1.638 


.065 


.008 


.000 


.816 


±1.0 


1.736 


1.744 


1.778 


1.776 


.143 


.007 


.001 


.697 


dbl.2 


1. 934 


1.921 


1.951 


1.949 


.220 


.006 


.001 


.542 


±1.4 


2.168 


2. 138 


2. 163 


2.162 


.293 


.005 


.001 


.337 


±1.6 


2.438 


2.401 


2.420 


2.419 


.361 


.006 


.001 


.073 


±1.8 


2.744 


2.715 


2. 725 


2.725 


.421 


.009 


.001 


-.265 


±2.0 


3.086 


3. 086 


3. 086 


3. 086 


.474 


.017 


.001 


-.693 



Table 4 



2/= 2 cosh - 



y C) = 2.5894 -.8322r+.3:r2 (0<.r<5) 



X 


y(0) 


yd) 


y{2) 


y(3) 


y 


CO 


C\ 


C2 


<'3 


V2 


.0 


2. 589 


2.589 


2.589 


2.589 


2.589 


.054 


.012 


.004 


.001 


.000 


.2 


2. 435 


2.468 


2.449 


2.439 


2.438 


.019 


.011 


.004 


.001 


.189 


.4 


2. 304 


2.368 


2.331 


2.313 


2. 310 


-.014 


.011 


.004 


.001 


.357 


.6 


2.198 


2.287 


2. 235 


2. 210 


2. 206 


-.044 


.Oil 


. 003 


.001 


.507 


.8 


2.116 


2. 226 


2. 160 


2.128 


2. 124 


-.069 


.010 


.003 


.001 


.642 


LO 


2.057 


2.184 


2.106 


2. 068 


2. 062 


-.088 


.010 


.003 


.001 


.762 


L2 


2.023 


2. 160 


2.072 


2.029 


2.022 


-.099 


.010 


.003 


.001 


.871 


1.4 


2.012 


2. 154 


2. 057 


2.010 


2.002 


-. 103 


.005 


.002 


.000 


.970 


L6 


2. 026 


2. 167 


2. 061 


2.010 


2. 002 


-.098 


. 009 


.002 


.000 


1.058 


1.8 


2. 063 


2.198 


2. 085 


2. 031 


2.022 


-. 086 


.009 


.002 


.000 


1.138 


2.0 


2. 125 


2.248 


2.129 


2.072 


2. 062 


-.066 


.009 


.002 


.000 


L209 


2.2 


2.211 


2.316 


2.192 


2. 133 


2.124 


-.040 


.009 


.002 


.000 


1.272 


2.4 


2.320 


2.404 


2.277 


2.215 


2. 206 


-.010 


.009 


.002 


. 000 


1.328 


2.6 


2. 454 


2. 513 


2. 3X3 


2.320 


2. 310 


.023 


. 009 


.002 


. 000 


1. 374 


2.8 


2.611 


2. 643 


2. 511 


2.448 


2. 438 


.058 


.008 


.002 


.000 


1.412 


3.0 


2.793 


2.795 


2. 663 


2. 599 


2.590 


.093 


.008 


.002 


.000 


1.441 


3.2 


2.998 


2.970 


2.840 


2.777 


2. 768 


.128 


.008 


.002 


.000 


1. 459 


3.4 


3.221 


3. 171 


3. 013 


2. 982 


2.972 


.162 


.007 


.002 


.001 


1. 464 


3.6 


^.482 


3.398 


3.276 


3.217 


3. 208 


.194 


.007 


.002 


.000 


1. 456 


3.8 


3.759 


3. 654 


3. 539 


3.483 


3.474 


.224 


.007 


.002 


.001 


1.431 


4.0 


4. 061 


3.941 


3.836 


3. 785 


3.776 


.252 


.008 


.003 


.001 


L388 


4.2 


4. 386 


4.261 


4.169 


4.124 


4.116 


.277 


.009 


.003 


.001 


1. 323 


4.4 


4. 736 


4.616 


4. 541 


4. 503 


4.428 


.300 


.011 


.004 


.001 


1. 232 


4.6 


5.109 


5.011 


4.955 


4.928 


4. 924 


.323 


.013 


.004 


.001 


1.111 


4.8 


5.507 


5. 447 


5.416 


5. 401 


5. 400 


.343 


.017 


.004 


.001 


. 956 


5.0 


5. 928 


5. 928 


5.928 


5.928 


5. 928 


.361 


.022 


.005 


.001 


.760 
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Table 5 



2/ = 1.888cosh- 



y(0) =2.5891-1.2322r+.38a;2 (0<J<5) 



X 


y(0) 


y{\) 


y(2) 


y(3) 


y 


Co 


Cl 


'2 


H 


V2 





2.589 


2. 589 


2. 589 


2. 589 


2. 589 


.212 


.002 


.009 


.002 


.000 


.2 


2. 358 


2. 375 


2. 403 


2.414 


2.416 


.159 


-.008 


.008 


.001 


.185 


.4 


2,157 


2.194 


2.245 


2. 267 


2. 270 


. 103 


-.003 


.007 


.001 


.344 


.6 


1.987 


2.044 


2.114 


2. 145 


2. 149 


.046 


. 003 


. 00() 


.001 


.479 


.8 


L847 


1.923 


2.009 


2.047 


2. 052 


-.008 


.008 


.005 


.001 


.595 


1.0 


1.737 


1.828 


1.927 


1.927 


1.978 


-.056 


.012 


.005 


.001 


.693 


L2 


1. 658 


1. 758 


1.869 


1.920 


1.927 


-.095 


.016 


.004 


.001 


.777 


1.4 


1. 609 


1.712 


1.833 


1.889 


1. 897 


-.121 


.019 


. 003 


.001 


.847 


1.6 


1.591 


1.690 


1.819 


1. 880 


1.888 


-. 131 


.021 


. 003 


. 001 


.905 


1.8 


1. 603 


1.690 


1.827 


1.892 


1.901 


-.124 


. 022 


.002 


.001 


.953 


2.0 


1.645 


1.714 


1.857 


1.926 


1. 935 


-.102 


.022 


.002 


.001 


.989 


2.2 


1.718 


1.761 


1.910 


1.981 


1. 990 


— .065 


.021 


.002 


.001 


1.015 


2.4 


1.821 


1.831 


1.986 


2. 059 


2. 068 


-.018 


.019 


.002 


.001 


1. 030 


2.6 


1.954 


1.927 


2. 085 


2. 160 


2. 170 


. 035 


.017 


.002 


.001 


1.034 


2.8 


2.118 


2.049 


2.210 


2.285 


2. 295 


.091 


.015 


.002 


.001 


1.024 


3.0 


2.313 


2.198 


2. 361 


2. 437 


2.447 


.147 


.012 


.003 


.001 


1.000 


3.2 


2. 538 


2. 378 


2. 541 


2. 616 


2. 626 


.201 


. 009 


.003 


.001 


.960 


3.4 


2. 793 


2.590 


2. 751 


2.824 


2. 834 


.2o2 


. 006 


.004 


.001 


.900 


3.6 


3. 078 


2. 839 


2. 995 


3. 065 


3. 074 


. 300 


.005 


.004 


.001 


.818 


3.8 


3. 394 


3.126 


3.274 


3.340 


3. 349 


.342 


.004 


.005 


.001 


.710 


4.0 


3. 741 


3. 456 


3.593 
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